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BACKGROUND 



The present invention relates generally to data distribution systems, and more 
particularly, to improved methods and apparatus for providing Internet cache content 
delivery using a data distribution system. 

The assignee of the present invention has developed a data distribution system, 
and in particular, a satellite-based data distribution system, that is used to distribute data 
supplied by content providers to personal computers of clients. In order to allow clients 
to have improved response times to obtain data distributed by the data distribution 
system, an important aspect of the system was developed relating to caching services 
that addresses this need. 

Several companies, including Inktomi, Cache Flow, and NetWork Appliance, 
for example, currently make cache appliances or specific software which are targeted at 
Internet service providers. SkyCache is currently operating caching services using 
satellite communications. 

It would be an improvement to have a method and apparatus that provides 
Internet cache content delivery using a data distribution system. 



The present invention provides for a cache system and caching method that is 
used to improve data delivery service provided by a data distribution system, such as a 
satellite-based data distribution system. The present invention uses a master cache in 
which predictive harvesting of content is performed based upon probability 
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distributions of individual caches satisfying requests from their users (clients). Content 
in the master cache is distributed to user sites by way of the satellite-based data 
distribution system, and local caches at user sites are automatically loaded with 
incoming content. The use of the satellite-based data distribution system provides an 
efficient economic multiplier. 

More specifically, an exemplary system comprises a master cache that receives 
content for distribution by the data distribution system to one or more users. A 
gateway receives content distributed by the data distribution system from the master 
cache. One or more local caches store the content received by the gateway destined for 
the one or more users. Harvesting software processes information from the master 
cache and the gateway that corresponds to probability distributions that the local caches 
satisfy requests from their respective users to predictively distribute the desired content 
to the respective users. 

Content distributed by the data distribution system to the gateway is transferred 
into the local cache by creating a pseudo client on the gateway, receiving an interrupt at 
the pseudo client indicating that that content has arrived at the gateway, enabling the 
gateway as a sibling cache for the local cache, requesting content to be transferred from 
the sibling cache to the local cache, verifying that content has been transferred to the 
local cache during the transfer process, and disabling the gateway as a sibling cache of 
the local cache at the completion of the process. The local cache retrieved the content 
from the sibling cache until all content has been transferred. 

The master cache is built by processing statistics derived from the master cache 
and the local cache to produces a list of content to add to the master cache and a list*6< 
of content to delete from the master cache. A pseudo client is formed that retrieves and 
verifies the content to be added to the master cache. The verified content is then 
transmitted from the master cache to the local cache. 

Caching is used to improve the response time of a user (client) that is served by 
the cache system employed in the data distribution system. Caching also benefits 
service providers by reducing communications costs between the service environment 
and the external world. Using satellite transmission services allows cost savings and 
response times to be further enhanced for a large number of service providers by 
allowing aggregation of their individual requests. 

The use of the satellite-based data distribution system provides for efficient 
distribution of cache content over a wide geographical area thereby allows many user 
sites to simultaneously update their local caches. This reduces cost of terrestrial 
infrastructure for the individual user sites. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The various features and advantages of the present invention may be more 
readily understood with reference to the following detailed description taken in 
conjunction with the accompanying drawings, wherein like reference numerals 
designate like structural elements, and in which: 

Fig. 1 illustrates the architecture of an exemplary caching system in accordance 
with the principles of the present invention used in conjunction with a satellite-based 
data distribution system; 

Fig. 2 illustrates exemplary apparatus and a method in accordance with the 
present invention for forcing content into a local cache; and 

Fig. 3 illustrates an exemplary method in accordance with the present invention 
for building the master cache. 

DETAILED DESCRIPTION 

Referring to the drawing figures, Fig. 1 illustrates the architecture of an 
exemplary caching system 10 in accordance with the principles of the present invention 
the invention is used in conjunction with a data distribution system 30, such as a 
satellite-based data distribution system 30 developed by the assignee of the present 
invention or an RF data distribution system, for example. An exemplary satellite-based 
data distribution system 30 comprises a network operations center 1 1 that includes 
subsystems and software that permits use of the data distribution system 30 by 
subscribers (users). This is illustrated as subscriber entitlement 12. 

The subscriber entitlement subsystem originates entitlement messages, which 
activates the entitled gateways to decode the data transmitted over the data distribution 
system 10. 

The satellite-based data distribution system 30 distributes content received from 
service providers by means of a satellite multicast feed 13 distributed by way of a 
satellite 14 to a gateway 15, referred to as a CyberStar Enterprise Gateway 15, and 
which is part of the present caching system 10. The content is typically transmitted in 
the form of http (hypertext transport protocol) objects such as web pages or nttp (News 
text transport protocol) objects such as news groups. 

Content received by the gateway 15 is transferred to a local cache 20 which is 
typically located on a user computer or server computer at a user site. For example, the 
content is stored in the local cache 20 on a server computer and is transferred to a user 
computer that is coupled to a user network 24 by way of a switch 22, for example. A 
router 23 is used to connect the user network 24 to the Internet 25. 
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The exemplary caching system 10 also includes a master cache 18 and harvest- 
ing software 17 that is typically disposed on a computer 19 at the network operations 
center 1 1 . The computer 19 is coupled to the Internet 25 and receives content from 
content providers for distribution. The computer 19 is coupled by way of a backchan- 
nel 16 to the gateway 15. The backchannel 16 may be implemented in a variety of 
ways, including a dedicated narrowband circuit (telephone), a virtual private 
narrowband channel, or an internet connection, for example. The backchannel 16 is 
used to transmit hit/miss data and probability tables from the gateway 15 to the 
computer 19, and the harvesting software 17 processes information contained in the 
transmit hit/miss data and probability tables. 

Fig. 2 illustrates exemplary apparatus 40 and a method 50 in accordance with 
the present invention for forcing content into the local cache 20. The apparatus 40 
comprises the gateway 15 which is coupled to the local cache 20 as discussed with 
reference to Fig. 1. The method 50 is implemented in the gateway 15 and processes 
content received from the satellite 14, typically in the form of a multicast satellite feed. 
The method 50 comprises the following steps. 

Received content 5 1 is forced into the local cache 20 using the method 50 as 
follows. An entitlement message is received 52 by the gateway 15 from the subscriber 
entitlement subsystem that notifies 53 a pseudo client 41 that new objects have arrived. 
The pseudo client 41 is a computer program which runs on the gateway computer and 
uses the appropriate communication protocol to communicate with the local cache. An 
example of this protocaol is ICP (Internet Cache Protocol). However, the protocol may 
also be a proprietary one from a number of cache hardware vendors. The pseudo client 
41 also has processes that allow it to control or designate the presedence relationship 
beween the local cache arid the gateway. 

The pseudo client 41 enables 54 the gateway 15 as a sibling cache 42 for the 
local cache 20. The standard Internet Protocol (IP), or Internet Cache Protocol (ICP), 
is used to communicate between the local cache 20 and the sibling cache 42. The 
pseudo client 41 then requests 55a the new objects 53 having arrived on the sibling 
cache 42. This request 55a makes use of standard control settings which force the local 
cache 20 to render any objects resident at the local cache as unable to satisfy the request 
55a. The local cache 20 then requests 55b the object from its sibling cache 42. The 
sibling cache 42 complies by sending 56 the object to the local cache 20. The local 
cache 20 in turn fowrards the object to the pseudo client 41 thus verifying 57 the object 
has been transferred to the local cache 20. The pseudo client 41 continues to request 
55a objects from the local cache 20 until all the new objects hve been transferred to the 
local cache 20. During the transfer process, the pseudo client 41 verifies 57 that all 



pages have been transferred to the local cache 20. At the completion of the process, the 
pseudo client 41 disables 58 the gateway 15 as a sibling cache 42 of the local cache 20. 

Fig. 3 illustrates an exemplary method 60 in accordance with the present 
invention for building the master cache 18. The harvest analysis software 17 processes 
statistics 61 from the master cache 18 and all local caches 20. Parameters and statistics 
used by the harvest analysis software are, for example, time to live (TTL) the number 
of times an object was requested, the number of times an object was requested and 
satisfied by local cache 20 (Hit), the number of times an object was requested and 
satisfied by a server other than the local cache 20 (miss). Objects are added/dropped 
according to high mis counts and low hit counts. The TTL parameter can also be 
adjusted to reduce the object's miss count. The statistics are collected by the local cache 
20^as^art of its normal operation. These statistics are periodically transferred to the 
^ate-way-15. 

The harvest analysis software 17 produces a list 62 of objects (corresponding to 
each item of content) to add to the master cache 18 and a list 64 of objects to delete from 
the master cache 18. A pseudo client 41a is used to retrieve 65 and verify 66 the objects 
to be added to the master cache. The master cache forwards 67 the content to the 
satellite NOC for transmission according to various distribution policies. The satellite 
NOC adds a level of conditional access onto the transmission stream to provide service 
to only those sites which have been entitled to the cache service. 

Thus, an improved method and apparatus for providing Internet cache content 
delivery using satellite transmission services have been disclosed. It is to be 
understood that the described embodiments are merely illustrative of some of the many 
specific embodiments that represent applications of the principles of the present 
invention. Clearly, numerous and other arrangements can be readily devised by those 
skilled in the art without departing from the scope of the invention. 



